Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Voice recognition software accuracy with second language speakers of English

Identifieur interne : 001E69 ( Main/Exploration ); précédent : 001E68; suivant : 001E70

Voice recognition software accuracy with second language speakers of English

Auteurs : D. Coniam [Hong Kong]

Source :

RBID : ISTEX:9E60D439B771A434534991EDCEAEA54E8210303B

Abstract

This paper explores the potential of the use of voice recognition technology with second language speakers of English. The study is a development of an earlier study conducted with a small group of native speakers (Coniam, 1998a, TEXT Technology 8.). The current study involves the analysis of the output produced by a small group of very competent second language subjects reading a text into the voice recognition software Dragon Systems ‘Dragon Naturally Speaking’. As the program is speaker-dependent and has to be trained to recognise each person's voice, subjects first spent about 45 minutes reading a training text of some 3800 words. As the test text, they then read a second text consisting of 1050 words. The output produced by the software was analysed in terms of words, sub-clausal units, clauses and t-units. In terms of accuracy, the second language speakers' output on each category of analysis was significantly lower than that achieved by the native speakers. Nonetheless, the results were consistent in line with the native speakers' scores; i.e. that the highest accuracy scores were achieved at the lowest (and most discrete) level of analysis, the word level, and the lowest scores at the t-unit, or sentence level of analysis. The paper concludes that voice recognition technology is still an at early stage of development in terms of accuracy and single-speaker dependency. Nonetheless, the fact that consistent results have emerged suggests that the development of an assessment tool, such as a reading aloud test via voice recognition technology and determining a score through an analysis of the output, may be a testing procedure with potential.

Url:
DOI: 10.1016/S0346-251X(98)00049-9


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Voice recognition software accuracy with second language speakers of English</title>
<author>
<name sortKey="Coniam, D" sort="Coniam, D" uniqKey="Coniam D" first="D." last="Coniam">D. Coniam</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:9E60D439B771A434534991EDCEAEA54E8210303B</idno>
<date when="1999" year="1999">1999</date>
<idno type="doi">10.1016/S0346-251X(98)00049-9</idno>
<idno type="url">https://api.istex.fr/document/9E60D439B771A434534991EDCEAEA54E8210303B/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001226</idno>
<idno type="wicri:Area/Istex/Curation">001152</idno>
<idno type="wicri:Area/Istex/Checkpoint">001420</idno>
<idno type="wicri:doubleKey">0346-251X:1999:Coniam D:voice:recognition:software</idno>
<idno type="wicri:Area/Main/Merge">001F78</idno>
<idno type="wicri:Area/Main/Curation">001E69</idno>
<idno type="wicri:Area/Main/Exploration">001E69</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Voice recognition software accuracy with second language speakers of English</title>
<author>
<name sortKey="Coniam, D" sort="Coniam, D" uniqKey="Coniam D" first="D." last="Coniam">D. Coniam</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Hong Kong</country>
<wicri:regionArea>Faculty of Education, The Chinese University of Hong Kong, Sha Tin</wicri:regionArea>
<placeName>
<settlement type="city">Sha Tin</settlement>
<region type="Région">Hong Kong</region>
</placeName>
<orgName type="university">Université chinoise de Hong Kong</orgName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">System</title>
<title level="j" type="abbrev">SYS</title>
<idno type="ISSN">0346-251X</idno>
<imprint>
<publisher>ELSEVIER</publisher>
<date type="published" when="1998">1998</date>
<biblScope unit="volume">27</biblScope>
<biblScope unit="issue">1</biblScope>
<biblScope unit="page" from="49">49</biblScope>
<biblScope unit="page" to="64">64</biblScope>
</imprint>
<idno type="ISSN">0346-251X</idno>
</series>
<idno type="istex">9E60D439B771A434534991EDCEAEA54E8210303B</idno>
<idno type="DOI">10.1016/S0346-251X(98)00049-9</idno>
<idno type="PII">S0346-251X(98)00049-9</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0346-251X</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper explores the potential of the use of voice recognition technology with second language speakers of English. The study is a development of an earlier study conducted with a small group of native speakers (Coniam, 1998a, TEXT Technology 8.). The current study involves the analysis of the output produced by a small group of very competent second language subjects reading a text into the voice recognition software Dragon Systems ‘Dragon Naturally Speaking’. As the program is speaker-dependent and has to be trained to recognise each person's voice, subjects first spent about 45 minutes reading a training text of some 3800 words. As the test text, they then read a second text consisting of 1050 words. The output produced by the software was analysed in terms of words, sub-clausal units, clauses and t-units. In terms of accuracy, the second language speakers' output on each category of analysis was significantly lower than that achieved by the native speakers. Nonetheless, the results were consistent in line with the native speakers' scores; i.e. that the highest accuracy scores were achieved at the lowest (and most discrete) level of analysis, the word level, and the lowest scores at the t-unit, or sentence level of analysis. The paper concludes that voice recognition technology is still an at early stage of development in terms of accuracy and single-speaker dependency. Nonetheless, the fact that consistent results have emerged suggests that the development of an assessment tool, such as a reading aloud test via voice recognition technology and determining a score through an analysis of the output, may be a testing procedure with potential.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Hong Kong</li>
</country>
<region>
<li>Hong Kong</li>
</region>
<settlement>
<li>Sha Tin</li>
</settlement>
<orgName>
<li>Université chinoise de Hong Kong</li>
</orgName>
</list>
<tree>
<country name="Hong Kong">
<region name="Hong Kong">
<name sortKey="Coniam, D" sort="Coniam, D" uniqKey="Coniam D" first="D." last="Coniam">D. Coniam</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001E69 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001E69 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:9E60D439B771A434534991EDCEAEA54E8210303B
   |texte=   Voice recognition software accuracy with second language speakers of English
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024